DESCRIPTION 

rt 

METHOD OF PRODUCING TEMPLATE DNA AND METHOD OF PRODUCING 
PROTEIN IN CELL- FREE PROTEIN SYNTHESIS SYSTEM USING THE SAME 

5 

Technic al Field 

The present invention relates to a method of producing 
a template DNA for protein synthesis and a method of producing 
a protein using a template DNA that is produced by the method. 

10 Background Art 

Regarding a huge number of genes derived from a great 
deal of genomic sequence information of organisms, a research 
called "Structural Genomics" is now in progress, which is a 
systematic and comprehensive analysis of the relationship 

15 between structure and function of proteins by determining three 
dimensional structures of the proteins encoded by the 
respective genes. As the total genome sequences were 
clarified, it was found that three-dimensional protein 
structures considered to be innumerable actually comprise 

20 combinations of one to several thousands of basic structures 
(folds) and these combinations seem likely to realize the 
diversity of functions. Accordingly, high- throughput 
technologies throughout the processes from synthesis to 
structural analysis of proteins will make it possible to reveal 

25 all of the basic structures of the proteins and based on the 
knowledge of the basic structures, it will be enabled to 
elucidate the relationship between structure and function of 
the proteins. 

As a system for expressing and preparing a number of 
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protein samples at good efficiency, cell-free protein 
synthesis systems have been improved by various modifications 
such as introduction of dialysis, to obtain proteins in the 
order of milligrams within several hours (refer to Kigawa et 
5 al . , FEBS Lett. , vol. 442, pp. 15-19, 1999 and Japanese Patent 
Kokai publication JP-A-2000-175695) . To express proteins in 
this cell-free protein synthesis system at good efficiency, 
a double -stranded DNA containing an appropriate expression 
regulatory region and a gene sequence coding for a target 

10 protein to be expressed is needed as a template DNA. In order 
to express any genes cloned in a cloning vector free of an 
appropriate expression regulatory sequence in a cell -free 
protein synthesis system, it is necessary to add an appropriate 
expression regulatory sequence to these genes. Therefore, 

15 several methods have been so far performed, in which a desired 
DNA fragment is excised from a plasmid vector comprising a gene 
by restriction enzymes or PCR amplification, then re-cloned 
in a vector having an appropriate expression regulatory 
sequence, or a desired DNA fragment was further amplified by 

2 0 PCR. 

To obtain a highly efficient protein expression, it is 
required to promote the transcription of the gene with strong 
promoter or terminator sequence as well as to enhance the 
translation by improving the affinity between mRNA and 
25 ribosomes . Further, for quickly purifying or detecting 
synthesized proteins, it is also required to design fusion 
proteins having incorporated therein a tag sequence for 
affinity purification or detection. 

When the template DNA suitable for such protein synthesis 
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is prepared, however, much expense in time and effort is 
necessary to clone the DNA by genetic engineering techniques 
using living cells such as E. coli, because the methods are 
complicated and difficult to make high- throughput by 
5 automation. It is also problems to construct the template DNA 
by complex recombination, or to synthesize many kinds of 
primers for PCR for the optimization of respective genes. 
Disclosure of the Invention 

It is therefore an object of the present invention to 

10 provide an efficient method of preparing a template DNA to 
express and purify a protein. 

In view of the foregoing aim, the present inventors have 
assiduously conducted investigations on a method of producing 
a template DNA in a cell- free protein synthesis system, and 

15 have consequently found that a linear template DNA appropriate 
for expression of a protein can be synthesized very quickly 
and efficiently by amplifying an optional cloned DNA selected 
from cDNA library through 2 -step PCR. This finding has led 
to the completion of the invention. 

20 In the first aspect of the present invention, there is 

provided a method of producing a template DNA used for protein 
synthesis comprising a step of: 

amplifying a linear double- stranded DNA by polymerase 
chain reaction (PCR) , using a reaction solution comprising, 

25 a first double- stranded DNA fragment comprising a 

sequence coding for a protein or a portion thereof, 

a second double- stranded DNA fragment comprising a 
sequence overlapping with the 5* terminal region of the first 
DNA fragment, 
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a third double- stranded DNA fragment comprising a 
sequence overlapping with the 3 1 terminal region of the first 
DNA fragment, 

a sense primer which anneals with the 5' terminal region 
5 of the second DNA fragment, and 

an anti-sense primer which anneals with the 3' terminal 
region of the third DNA fragment, 

wherein the second DNA fragment comprises a regulatory 
sequence for transcription and translation of a gene, and the 
10 concentrations of the second DNA fragment and the third DNA 
fragment in the reaction solution each range from 5 to 2,500 
pmol/L . 

In a preferred embodiment, the invention is 
characterized in that the reaction solution (second PCR 

15 solution) comprises first PCR products obtained by polymerase 
chain reaction (first PCR) to amplify the firs t double- stranded 
DNA fragment, and the respective concentrations of primers 
remaining in the first PCR products and primer dimers produced 
in the first PCR are less than 20 nmol/L in the second PCR 

20 solution. 

Thus, the invention is characterized in that for 
decreasing the amounts, incorporated into the second PCR, of 
primers used in the first PCR and primer dimers produced as 
byproducts, the concentrations of the primers used for the 
25 first PCR are in the range of from 20 to 500 nmol/L, or the 
second PCR is performed with the first PCR products which are 
diluted to 10- to 100-fold (at a final concentration in the 
second PCR solution) , or both of these are performed in 
combination, or the second PCR is performed after removing the 
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primers and the primer dimers from the first PCR solution. 

In one embodiment, the invention is characterized in that 
the first PCR is carried out using recombinant microorganisms 
or a culture broth thereof comprising the first double- stranded 
5 DNA fragment. 

In another preferred embodiment, the invention is 
characterized in that the sense primer and the anti-sense 
primer have the same nucleotide sequence, and/or the second 
DNA fragment comprises a transcription termination sequence. 
10 In the other preferred embodiment, the invention is 

characterized in that at least one of the second DNA fragment 
and the third DNA fragment comprises a sequence coding for a 
tag peptide, and the tag peptide is synthesized by being fused 
with the protein or a portion thereof. The tag peptide is 
15 preferably maltose binding protein, cellulose binding domain, 
glutathione-S- transferase, thioredoxin, streptavidin binding 
peptide or histidine tag peptide. 

The invention provides, in another aspect, a method of 
producing a protein in a cell-free protein synthesis system, 
20 characterized by using a template DNA that is produced by any 
of the foregoing methods. 
Brief Description of the Drawings 

Fig. 1 is a flow chart showing a method of producing a 
linear template DNA, which is one embodiment of the present 
25 invention. 

Fig. 2 is a photograph analyzed by agarose gel 
electrophoresis of samples obtained by PCR amplification of 
template DNAs coding for Ras and CAT proteins fused with various 
tag peptides according to the method of the invention. 



5 



Fig. 3 is a graph showing synthesized amounts of Ras and 
CAT proteins fused with various tag peptides. 

Fig. 4 is a photograph analyzed by SDS-PAGE of Ras and 
CAT proteins fused with various tag peptides. 
5 Fig. 5 is a graph showing synthesized amounts of proteins 

derived from mouse cDNA clones synthesized by fusion with GST 
and native His tag sequences. 
preferred Embodim ents of the Invention 

One embodiment of the present invention is described in 
10 detail below along the attached drawings. Fig. 1 is a flow 
chart showing a method of producing a linear double- stranded 
template DNA for protein synthesis by 2-step PCR using a plasmid 
DNA with a full-length cDNA cloned. 

(1) First PCR 

15 With respect to the template DNA of the first PCR, any 

template DNA will be used, so long as it comprises a sequence 
coding for a desired protein. For example, not only a DNA 
cloned from cDNA library but also a genomic DNA clone or a 
synthetic DNA can be used. Further, an unpurif ied DNA is also 

20 available. For example, a strain carrying a plasmid vector 
comprising the foregoing sequence or a culture broth of the 
strain can be used. Fig. 1 shows modified bluescript 1 which 
is a plasmid obtained by cloning mouse full-length cDNA in Sad 
and Xhol sites of plasmid vector pBLUESCRIPT SK+ . 

25 In Fig. 1, two primers (5 1 -primer 1 and 5 1 -primer 2 ) are 

used as 5 1 -primers for amplifying a protein code region of a 
cloned cDNA. A nucleotide sequence of the 5 ■ -primer 2 is 
designed to anneal with a nucleotide sequence corresponding 
to an amino terminal region of the protein encoded by the cloned 
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cDNA. The 5 1 -primer 2 can further contain an arbitrary 
nucleotide sequence in the 5 1 -terminal side. Meanwhile, the 
5 1 -primer 1 may have a sequence common to a part of the 
5* -terminal side of the 5 1 -primer 2, and both of the primers 
5 overlap to function as the 5 ' -primers of the first PCR. 

On the other hand, in Fig. 1, the 3 1 -primer is designed 
to anneal with a region of the plasmid vector where is located 
in the 3 1 downstream of the protein coding region, and elongates 
the DNA chain by PCR together with the 5 1 -primers to amplify 

10 the first DNA fragment coding for the protein to be expressed. 
Since the first DNA fragment is used as a template of the second 
PCR to be described later, it can be purified from the PCR 
reaction solution by a known method, or the PCR reaction 
solution can also be used directly. In this case, the primers 

15 remaining in the reaction solution might inhibit the second 
PCR reaction. For this reason, it is preferable that the 
amounts, incorporated into the second PCR solution, of primers 
used in the first PCR and primer dimers produced as byproducts 
in the first PCR are decreased to less than 20 nmol/L. 

20 Specifically, when plural primers are used as the 5 1 -primer 
or the 3' -primer, they are preferably used at a total 
concentration of from 20 to 500 nmol/L, more preferably from 
50 to 100 nmol/L. 

Alternatively, the amounts, incorporated into the 

25 second PCR, of the primers and the primer dimers can be reduced 
by diluting the first PCR solution to 10- to 100-fold. 

Besides such methods, PCR using various primers is 
possible for obtaining the first DNA fragment. With respect 
to nucleotide sequences of primers, PCR reaction conditions 
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and the like employed at that time, ordinary ones are available, 
and it is desirable to properly determine them by those skilled 
in the art. It is further possible to obtain the first DNA 
fragment by excising a DNA fragment directly from the cDNA clone 
5 with a restriction enzyme or the like. 
(2) Second PGR 

Next, the double- stranded second DNA fragment 
(hereinafter sometimes referred to as " 5 • DNA fragment") 
comprising a sequence overlapping with the 5' terminal region 

10 of the first DNA fragment is prepared. The second DNA fragment 
overlaps with the 5 1 terminal region of the first DNA fragment 
in its 3 1 terminal region, whereby these two DNA fragments 
mutually become primers and a template to allow an elongation 
reaction, producing a template DNA in the second PCR. The 

15 overlapping region is preferably at least 12 base pairs, more 
preferably at least 17 bases. 

Subsequently, the double- stranded third DNA fragment 
(hereinafter sometimes referred to as "3* DNA fragment") 
comprising a sequence overlapping with the 3' terminal region 

20 of the first DNA fragment is produced. The third DNA fragment 
overlaps with the 3 1 terminal region of the first DNA fragment 
in its 5 1 terminal region, whereby these two DNA fragments 
mutually become primers and a template to allow an elongation 
reaction, producing a template DNA in the second PCR. The 

25 overlapping region is preferably at least 12 base pairs, more 
preferably at least 17 bases. 

Then, as the primers for the second PCR, the sense primer 
which anneals with the 5 1 terminal region of the 5 1 DNA fragment 
and the anti-sense primer which anneals with the 3' terminal 



region of the 3' DNA fragment are synthesized. These primers 
may anneal with the template DNA of the second PCR to allow 
a DNA elongation reaction and amplify a desired DNA fragment. 
Plural primers that anneal with the both terminal regions of 
5 the template DNA can be used. Preferably, when the both 
terminal sequences are complementary nucleotide sequences, an 
amplification reaction is possible with one type of a primer 
(universal primer) as shown in Fig. 1. This primer usually 
comprises a single- stranded oligonucleotide of from 5 to 50 

10 bases, preferably a single- stranded oligonucleotide of from 
15 to 25 bases. The use of one type of a universal primer is 
more advantageous than the use of two types of primers because 
byproducts are less formed by PCR. 

Generally, it is known that byproducts (primer dimers) 

15 in which primers are paired are formed by PCR. In case of using 
two types of primers, primer dimers once formed serve as a 
template DNA to amplify the byproducts and decrease amounts 
of primers used for amplification of a desired product. 
Consequently, an amount of a desired product is considered to 

20 be decreased. Meanwhile, when PCR is performed using only one 
type of a primer, it is considered that the resulting primer 
dimers tend to take a hairpin structure because they have 
complementary sequences in the same molecule, and are therefore 
less amplified by PCR to less form byproducts. 

25 The 5' DNA fragment contains sequences of inducing and 

regulating transcription and translation of a gene in the 
upstream of the region overlapping with the first DNA fragment . 
The sequences of inducing and regulating the transcription of 
the gene are called promoter and operator sequences, and they 
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have been studied in detail in prokaryotic cells and eukaryotic 
cells such as Escherichia coli and yeasts. For example, T7 
promoter derived from a phage of Escherichia coli is used. it 
is known that in T7 promoter, potent transcription is performed 
5 with T7 RNA polymerase. Translation of an mRNA into a protein 
is induced by binding a translation initiation complex of a 
ribosome or the like to an mRNA. A ribosome binding sequence 
(RBS) is called an SD sequence, and important for efficient 
expression of a protein. 

10 Further, these expression regulatory sequences contain, 

in the upstream, the sequence with which the sense primer 
anneals in PCR. The nucleotide sequence of the sense primer 
here may be one which serves as a primer by being hybridized 
with the 5' terminal region of the 5* DNA fragment, and it 

15 includes one in which one or more bases in the sequence 
complementary to the template DNA are deleted, substituted or 
added . 

Likewise, the 3' DNA fragment contains, in the 3' 
terminal region, the sequence with which the anti -sense primer 

2 0 anneals in PCR. The anti -sense primer may be one which serves 
as a primer by being hybridized with the 3' terminal region 
of the 3' DNA fragment, and it includes one in which one or 
more bases in the sequence complementary to the template DNA 
are deleted, substituted or added. 

25 The concentrations of the 5' DNA fragment and the 3' DNA 

fragment in the reaction solution of the second PCR are 
preferably lower than the concentrations of primer DNAs in 
usual PCR, and they are used at concentrations of from 5 to 
2,500 pmol/L, preferably from 10 to 500 pmol/L. When the 
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concentrations of these DNA fragments are higher than 2,500 
pmol/L, byproducts tend to be formed during PCR. That is, 
single- stranded DNAs produced from the universal primer bound 
to the 5' DNA fragment and the 3* DNA fragment are bound in 
5 the same manner as the foregoing primer dimers, with the result 
that byproducts in which the 5' DNA fragment and the 3' DNA 
fragment are directly bound tend to be formed. The reason is 
that since these byproducts are shorter than the desired 
products, they tend to be amplified by PCR, act as a template 

10 that expresses only short proteins in the protein synthesis, 
and have therefore an adverse effect on the protein synthesis. 

Meanwhile, when the concentrations of the DNA fragments 
are lower than 5 pmol/L, the amount of a DNA that serves as 
a template itself is decreased, and a sufficient amount of a 

15 desired DNA is not amplified. The other reaction conditions 
of the second PCR can properly be selected and used by those 
skilled in the art. 

It is preferable that the 3* DNA fragment further 
comprises a transcription termination sequence. The 

20 transcription termination sequence is a DNA sequence of 
prompting separation of an RNA polymerase, and it has generally 
a characteristic structure of a symmetrical sequence enriched 
in GC and followed by T to allow an efficient transcription 
reaction . 

25 The second PCR can be performed even with one or both 

of the 5' DNA fragment and the 3' DNA fragment being 
single-stranded DNA(s) instead of the double- stranded DNA(s) , 
and this embodiment is also included in the scope of the 
invention . 
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(3) Tag peptide 

It is preferable that at least one of the 5* DNA fragment 
and the 3' DNA comprises a sequence coding for a tag peptide. 
The tag peptide is an amino acid sequence added to an N terminal 
5 and/or a C terminal of a protein to be expressed, and this is 
a sequence as a mark in affinity purification or western 
blotting detection of the protein. Examples thereof include 
glutathione-S- transferase (GST) , maltose binding protein 
(MBP) , thioredoxin (TrxA) , cellulose binding domain (CBD) , 

10 streptavidin binding peptide (for example, Streptag 
(trademark)) and histidine tag peptide. 

Glutathione-S- transferase (GST) is a soluble enzyme 
protein. When a desired gene is incorporated in the downstream 
of this gene sequence by adapting the frame thereto, a fusion 

15 protein with GST can be expressed. Recombinant vectors, pGEX 
Vectors for this purpose are commercially available from 
Amersham Pharmacia Biotech. It is used for affinity 
purification or enzymatic immunostaining by utilizing an 
antibody that specifically recognizes a protein portion of GST 

20 or a property of binding to glutathione. 

The maltose binding protein (MBP) is maltose binding 
protein of Escherichia coli . A fusion protein with MBP can 
be adsorbed on amylose or agarose gel, then separated with 
excess maltose and purified. Further, anti-MBP antibody can 

25 also be used. 

Thioredoxin (TrxA) is a protein of Escherichia coli that 
catalyzes a redox reaction, and can be purified by metal chelate 
affinity chromatography with a pair of functional thiol groups . 
As a carrier for this purpose, for example, ThioBond 
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(trademark) Resin (manufactured by Invitrogen) is 
commercially available. 

The cellulose binding domain (CBD) is a cellulose binding 
domain sequence derived from Clostridium cellulovorans and 
5 Cellulomonasfini, and has a property of specifically binding 
to cellulose. It can be immobilized on an inert carrier such 
as cellulose or chitin without chemical modification. 

As the streptavidin binding peptide, for example, a 
peptide made of 8 amino acids, called Strep- tag II, is known, 

10 and it can be purified by being bound selectively to StrepTactin 
(trade mark) or Streptavidine (trademark). 

As the histidine tag, a peptide comprising at least 6 
histidines which are located continuously or in close proximity 
is preferable. The protein encoded by the first DNA fragment 

15 is bound to this histidine tag either directly or through an 
amino acid sequence. The histidine tag has a high affinity 
for a divalent metallic atom, especially, a nickel atom. 
Accordingly, the protein having the histidine tag is firmly 
bound to a nickel affinity matrix and can easily be purified. 

20 In an especially preferable embodiment, the histidine 

tag has an amino acid sequence described in SEQ ID No. 1. This 
sequence is a natural sequence (native His tag) derived from 
an N terminal of a lactic acid hydrogenase of a chicken muscle. 
Since the 6 histidine residues being basic amino acids are 

25 present by being properly separated, this is advantageously 
lower in isoelectric point than the histidine tag comprising 
the continuous 6 histidine residues and enables affinity 
purification under neutral buffer conditions. 

In this embodiment, these tag peptides are useful for 
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affinity purification or detection of the expressed protein. 
In addition, the fusion protein with the tag peptide is 
sometimes increased in expression amount in comparison to 
natural proteins. The reason is, though not clear, presumably 
5 that a complex with an mRNA, ribosome or the like is stabilized 
at the translation initiation stage to increase translation 
efficiency . 

These tag sequences may contain, in the downstream 
thereof, a nucleotide sequence that can provide a cleavage site 

10 of a protease such as thrombin, Factor Xa or enterokinase , and 
a desired protein can be purified by being separated from the 
tag sequences. 

Further, it is also possible that the tag sequences are 
located on the C terminal side of the desired protein and 

15 purification can easily be conducted while a function of a 
leader sequence of the N terminal side remains. 

(4) Protein synthesis using a cell-free protein 
synthesis system 

The thus-produced template DNA can synthesize a protein 

20 by various methods. For example, it is preferable that a 
protein is synthesized in a cell - free protein synthesis system . 
The cell-free protein synthesis system is a system that 
synthesizes a protein in vitro using a cell extract. As the 
cell extract, an extract of hitherto-known eukaryotic cells 

25 or prokaryotic cells, containing ingredients necessary for 
protein synthesis, such as ribosome and t-RNA, can be used. 
A cell extract derived from wheat germ, Escherichia coli (for 
example, E. coli S30 cell extract) or Thermus thermophilics is 
preferable because a high synthesis amount is obtained. This 
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E . coli S30 cell extract can be produced from E . coli A19 (rna 
met) , BL21, BL21 star, BL21 codon plus or the like according 
to a known method (refer to Pratt. J. M. et al . , Transcription 
and translation - a practical approach, (1984), pp. 179-209, 
5 compiled by Henes , B. D. andHiggins, S. J., IRL Press, Oxford), 
or a product marketed from Promega, Novagen or Roche may be 
used. 

Any of a batchwise method, a flow method and other known 
techniques can be applied to the cell-free protein synthesis 

10 system of the invention. Examples thereof can include a 
ultrafiltration membrane method, a dialysis membrane method, 
and a column chromatography method in which a translation 
template is fixed on a resin (refer to Spirin, A. et al . , Meth. 
In Enzymol . vol. 217, pp. 123 - 142, 1993). 

15 Besides the cell-free protein synthesis system, it is 

also possible that, for example, a template DNA is introduced 
into animal cells through lipofection or the like to express 
a gene, and a function of the gene product in viable cells is 
analyzed. 

20 Examples 

In the following Examples of the present invention, the 
results of expressing, in a cell - free protein synthesis system, 
human c-Ha-Ras protein, chloramphenicol acetyl transferase 
(CAT) and various cDNA clones arbitrarily selected from mouse 
25 full-length cDNA library (provided from Dr. Hayashizaki 
Yoshihide, Genome Exploration Research Group, Genomic 
Sciences Center, RIKEN) are described in detail below. 
However, the invention is not limited to these Examples. 
[Example 1] Expression of Ras protein 
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(1) First PCR 

PCR was performed using a linear double- stranded DNA 
(nucleotide sequence was described in SEQ ID No. 2) coding for 
Ras protein as a template and three types of primers, 5 1 primer 
5 1: hRBS>B+6 (SEQ ID. No. 3), 5' primer 2: pRas (SEQ ID No. 4) 
and 3' primer: p8 . 2 (SEQ ID No. 5) . The composition of the 
PCR solution and the amplification program were shown in Tables 
1 and 2 respectively. 

5' primer 1: hRBS>B+6 (SEQ ID No. 3) 
10 5 1 - CCGAAGGAGCCGCCACCAT - 3 1 

5 1 primer 2: pRas (SEQ ID No . 4) 

5 1 - GAAGGAGCCGCCACCATGACCGAATACAAACTGGTTGTAG - 3 1 
3' PRIMER: p8 . 2 (SEQ ID No. 5) 

5 1 - GCGGATAACAATTTCACACAGGAAAC - 3 1 



15 [Table 1] 



Composition 


Concentration 


Amount 


Final 
concentration 


Template plasmid 


1 ng/nl 


4 ul 


0.2 ng/Ml 


5' Primer 1 (hRBS>B+6) 


1.6 mM 


i Ml 


0.08 mM 


5' Primer 2 


0.2 (aM 


2 JLXl 


0.02 mM 


3' Primer (p8.2) 


2 nM 


1 Ml 


0.1 mM 


dNTPs (Toyobo) 


2 mM 


2 M> 


0.2 mM 


Expand HF buffer (Boehringer Mannheim) 


(10x) 


2 Ml 


(1x) 


(containing 15 mM magnesium chloride) 








Sterile distilled water 




7.85 Ml 




DNA polymerase (Boehringer Mannheim) 


3.5 U/ul 


0.15 Ml 


0.02625 U/mI 


Total amount 




20 Ml 
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[Table 2] 



STEP 1 


94°C 


2 min 


STEP 2 


94°C 


30 sec 


STEP 3 


60°C 


30 sec 


STEP 4 


72°C 


2 min 


STEP 5 


GOTO 2 for 9 times 




STEP 6 


94°C 


30 sec 


STEP 7 


60°C 


30 sec 


STEP 8 


72°C 


2 min + 5 sec/cycle 


STEP 9 


GOTO 6 for 19 times 




STEP 10 


72°C 


7 min 


STEP 11 


4°C 


forever 



(2) Second PCR 

Subsequently, the second overlap PCR was performed using 
5 the first PCR products obtained in the foregoing reaction, 5' 
DNA fragment: T7P (one of SEQ ID Nos . 6 to 10) having respective 
tag sequence in the downstream of the T7 promoter sequence, 
3' DNA fragment: T7T (SEQ ID No. 11) having a T7 terminator 
sequence and universal primer (YA1.2): 

10 5 1 -GCCGCTGTCCTCGTTCCCAGCC-3 1 (SEQ ID No. 12). The 
composition of the PCR solution and the amplification program 
were shown in Tables 3 and 2 respectively. Further, the 
outline of the tag peptides encoded in the 5 1 DNA fragment used 
here was shown in Table 4. Consequently, as shown in Fig. 1, 

15 a linear double- stranded DNA fragment capable of expressing 
a fusion protein of various tag sequences and Ras protein was 
amplified under control of the T7 promoter. 
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[Table 3] 



Composition 


Concentration 


Amount 


Final concentration 


First PCR product (template) 


(x 1/5) 


5 pi 


(x 1/20) 


Universal primer (YA1 . 2) 


100 mM 


0.2 ul 


1 uM 


5' Fragment (T7P fragment) 


2 nM 


1 ul 


0.1 nM 


3' Fragment (T7T fragment) 


2 nM 


1 ul 


0.1 nM 


dNTPs (Toyobo) 


2 mM 


2 ul 


0.2 mM 


Expand HF buffer (Boehringer Mannheim) 


(10x) 


2 ul 


(1x) 


(containing 15 mM magnesium chloride) 








Sterile distilled water 




8.65 ul 




DNA polymerase (Boehringer Mannheim) 


3.5 U/ul 


0.15 ul 


0.02625 U/ul 


Total amount 




20 ul 





[Table 4] 



1 


GST 


glutathione-S-transf erase 


2 


MBP 


maltose binding protein 


3 


TrxA 


thioredoxin 


4 


CBD 


cellulose binding protein 


5 


His tag 


MKGSSHHHHHH 


6 


native His tag 


MKDHLIHNVHKEEHAHAHNK 



5 By the way, with respect to His tag and native His tag, 

amino acid sequences thereof were shown by one- letter 
abbreviations. 

(3) Protein synthesis in a cell-free protein synthesis 

system 

10 E . coli S30 extract was prepared from E . coli BL21 strain 

according to a method of Zubay et al . (Annu. Rev. Geneti . , 7, 
267-287, 1973). With respect to the protein synthesis 
reaction, Ras protein was synthesized at 37°C for 1 hour using 
a 96 -well microplate by adding 1 Jul of the second PCR products 

15 and 7.2 pi of the E. coli S30 extract to a solution of a 
composition shown in Table 5 below in each well thereof to 
adjust the whole volume of the reaction solution to 30 pi. 
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[Table 5] 



Composition 



Concentration 



HEPES-KOH pH 7.5 

Dithiothreitol 

ATP 

CTP, GTP, UTP 
Creatine phosphate 
Creatine kinase 
Polyethylene glycol 8000 
3\5'-cAMP 

L(-)-5-formyl-5,6,7,8-tetrahydroforic acid 

E. coli total tRNA 

Potassium glutamate 

Ammonium acetate 

Magnesium acetate 

Amino acids 

T7 RNA polymerase (Toyobo) 



0.9 mM each 
81.0mM 
250.0 |ug/ml 
4.0% 



1.5 mM each 
16.0 units/iul 



1 70.0 |ug/ml 



0.64 mM 
35.0 jug/ml 



200.0 mM 
27.7 mM 
10.7 mM 



58.0 mM 
2.3 mM 
1.2 mM 



[Example 2] Expression of CAT protein 

(1) First PCR 

The first PCR was performed in the same manner as in 
Example 1 using, as a template, a linear double- stranded DNA 
having a nucleotide sequence described in SEQ ID No. 13 which 
DNA codes for CAT protein. In this connection, primer pCAT 
specific for CAT gene and having a nucleotide sequence 
described in SEQ ID No. 14 was used as 5' primer 2. 

5 f primer 2: pCAT (SEQ ID No. 14) 

5 " - GAAGGAGCCGCCACCATGGAGAAAAAAATCACTGGATATAC - 3 ' 

(2) Second PCR and protein synthesis in a cell- free 
protein synthesis system 

The protein was synthesized in a cell- free protein 
synthesis system after second PCR in the same manner as in 
Example 1. 

[Example 3] Expression of mouse cDNA clones 



(1) First PCR 

Ten types of cDNA clones optionally selected from mouse 



full-length cDNA library were used as a template. These are 
obtained by cloning the respective cDNAs in Sacl and Xhol sites 
of plasmid pBluescriptSK+ , and have been registered in GenBank 
(Accession Nos . are ml6206 / m21532, X13605, u51204, 116904, 
5 S68022, d87663, x65627, m32599 and u85511 respectively) . PCR 
was performed using the common primers in Example 1 as 5 1 primer 
1 and 3 1 primer and primers specific for the respective cDNAs 
and having the following nucleotide sequences as 5 1 primer 2. 
The composition of the PCR solution and the program were shown 
10 in Tables 6 and 2 respectively. 

5' primer 2: plA2 (SEQ ID No. 15) 

5 ' - GAAGGAGCCGCCACCATGCTCAAAGTCACGGTGCCC - 3 ■ 
5' primer 2: plB2 (SEQ ID No. 16) 

5 1 - GAAGGAGCCGCCACCATGGAGGAGCAGCGCTGTTC - 3 1 
15 5' primer 2: plC8 (SEQ ID No. 17) 

5 1 - GAAGGAGCCGCCACCATGGCCCGAACCAAGCAGAC - 3 1 
5' primer 2: plD2 (SEQ ID No. 18) 

5 1 - GAAGGAGCCGCCACCATGGGTGTTGACAAAATCATTCC - 3 1 
5' primer 2: plD9 (SEQ ID No. 19) 
2 0 5 ' - GAAGGAGCCGCCACCATGTTGGAGACCTACAGCAACC - 3 1 

5' primer 2: PldlO (SEQ ID No. 20) 

5 ' - GAAGGAGCCGCCACCATGGCGGTGCAGGTGGTGC - 3 * 
5' primer 2: plE4 (SEQ ID No. 21) 

5 ' - GAAGGAGCCGCCACCATGGATGATCGGGAGGATCTG - 3 ' 
25 5 ' primer 2: P1G4 (SEQ ID No. 22) 

5 ' - GAAGGAGCCGCCACCATGTCGAGTTATTCTAGTGAC - 3 1 
5' primer 2: plHl (SEQ ID No. 23) 

5 ' - GAAGGAGCCGCCACCATGGTGAAGGTCGGTGTGAAC - 3 1 
5' primer 2: plH5 (SEQ ID No. 24) 
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5 ' -GAAGGAGCCGCCACCATGGCCAACAGTGAGCG-3 1 



[Table 6] 



Composition 


Concentration 


Amount 


Final concentration 


Template plasmid 


0.2 ng/ul 


4 mI 


0.04 ng/Ml 


5' Primer 1 (hRBS>B+6) 


10 uM 


1 Ml 


0.5 mM 


5' Primer 2 


0.1 uM 


1 |ul 


0.005 mM 


3' Primer (p8.2) 


10 uM 


1 Ml 


0.5 mM 


dNTPs (Toyobo) 


2 mM 


2 Ml 


0.2 mM 


Expand HF buffer (Boehringer Mannheim) 


(10 x) 


2 Ml 


(1x) 


(containing 15 mM magnesium chloride) 








Sterile distilled water 




8.85 Ml 




DNA polymerase (Boehringer Mannheim) 


3.5 U/iul 


0.15 Ml 


0.02625 U/mI 


Total amount 




20 Ml 




(2) Second PCR 








5 Second PCR was performed in the same manner as in Example 


1. However, regarding the 


composition of 


the PCR 


solution, 


a reaction solution of a composition shown in Table 7 was used 


instead of that shown in Table 3 . 






[Table 7] 








Composition 


Concentration 


Amount 


Final concentration 


First PCR product (template) 


(xl/10) 


5 Ml 


(x 1/40) 


Universal primer (YA1 . 2) 


100 uM 


0.2 Ml 


1 mM 


5' Fragment (T7P fragment) 


2 nM 


0.5 Ml 


0.05 nM 


3' Fragment (T7T fragment) 


2 nM 


0.5 Ml 


0.05 nM 


dNTPs (Toyobo) 


2 mM 


2 Ml 


0.2 mM 


Expand HF buffer (Boehringer Mannheim) 


(10 x) 


2 Ml 


(1x) 


(containing 15 mM magnesium chloride) 








Sterile distilled water 




9.65 Ml 




DNA polymerase (Boehringer Mannheim) 


3.5 U/ul 


0.15 Ml 


0.02625 U/mI 


Total amount 




20 Ml 





10 

[Example 4] Confirmation of synthetic products 

The results of analyzing by agarose gel electrophoresis 
a part of the reaction solutions amplified by PCR according 
to the methods of Examples 1 and 2 were shown in Fig. 2. Lanes 
15 1 to 7 show the following respectively. 1: first PCR products, 
2: second PCR products with the addition of His tag, 3: second 
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PCR products with the addition of native His tag, 4: second 
PCR products with the addition of GST tag, 5: second PCR 
products with the addition of MBP tag, 6: second PCR products 
with the addition of CBD tag, and 7: second PCR products with 
5 the addition of TrxA tag. It is found that only a single DNA 
band is detected in both the first PCR and the second PCR and 
DNAs of correct lengths having bound thereto tag sequences 
coding for the respective tag peptides are amplified in the 
second PCR. 

10 The proteins synthesized in the cell-free synthesis 

system by the methods of Examples 1 and 2 were quantitatively 
determined using TOPCOUNT after synthesis in the presence of 
14 C labeled Leu. The results were shown in Table 8 below and 
Fig. 3. Table 8 showed a presumptive molecular weight and an 

15 expression amount of Ras protein and CAT protein synthesized 
as fusion proteins with 6 types of tag peptides. Results of 
synthesis without the addition of a DNA and results of synthesis 
in a cell-free protein synthesis system using plasmids pk7 -Ras 
and pk7-CAT (refer to Kigawa et ah, FEBS Lett. , vol. 442, pp. 

20 15-19, 1999) which are circular double- stranded DNAs were shown 
as a control. These results revealed that the expression 
amounts were approximately the same as those in using as a 
template the circular double- stranded DNA without these tag 
peptides, though they vary with the types of the tag peptides 

25 used. 
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[Table 8] 



Ras CAT 



Tag sequence 


Molecular weight 
(kDa) 


Expression amount 
(pg/ml) 


Molecular weight 
(kDa) 


Expression amount 
(pg/ml) 


His tag 


22.2 


45.3 


28.4 


99.1 


Native His 


20.7 


268.9 


29.5 


413.7 


GST 


42.6 


293.8 


52.8 


374.9 


MBP 


57.3 


663.1 


67.4 


684.8 


CBD 


37.8 


467.8 


44.0 


493.9 


TrxA 


28.7 


375.2 


38.9 


335.8 


No addition of DNA 




14.1 






pk7-Ras 




498.5 






pk7-CAT 






28.6 


478.4 



Further, the results obtained by performing SDS-PAGE 
using 5 jul of the solution resulting from the protein synthesis 
5 in the presence of 14 C- labeled Leu were shown in Fig. 4. In 
Fig. 4, lanes 1 to 7 show the following respectively. 1: Ras 
or CAT protein with the addition of His tag, 2: Ras or CAT 
protein with the addition of native His tag, 3: Ras or CAT 
protein with GST tag, 4: Ras or CAT protein with the addition 

10 of MBP tag, 5: Ras or CAT protein with the addition of CBD tag, 
and 6: Ras or CAT protein with the addition of TrxA tag. In 
the drawing, in lanes indicated as Ras and CAT, proteins 
synthesized in a cell-free protein synthesis system using 
plasmids pk7-Ras and pk7-CAT as a template are shown as a 

15 control, and H and L indicate a high-molecular-weight marker 
and a low-molecular-weight marker respectively. From these 
results, it is found that Ras and CAT proteins with the addition 
of the respective tags are synthesized with correct molecular 
weights . 

20 Synthesis amounts of proteins derived from mouse cDNA 

clones synthesized by fusion with GST and native His tag 
sequences according to the method in Example 3 were shown in 
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Fig. 5. These results revealed that proteins can be 
synthesized efficiently from cDNA clones of many samples. 
[Comparative Example 1] 

Two-step PCR was performed in the same manner as in 
5 Examples 1 and 2 by varying concentrations of the 5 1 DNA 
fragment (T7P (GST) or T7P (His tag)) and the 3' DNA fragment 
(T7T) used in the second PCR. All of the other conditions were 
the same as in Examples 1 and 2. The results of examining the 
second PCR products by 1% agarose gel electrophoresis were 
10 shown in Table 9. 



[Table 9] 



Sample 


1 


2 


3 


4 


5 


6 


7 


Concentrations of T7P and T7T in 
second PCR solution (pmol/L) 


1 


5 


50 


500 


2500 


5000 


10000 


Results of analysis by agarose gel electrophoresis 
Ras + GST tag - 


O 


O 


A 


A 


A 


Ras + His tag 




O 


O 


O 


O 


A 


A 


CAT + GST tag 




O 


O 


0 


A 


A 


A 


CAT + His tag 




O 


O 


O 


O 


A 


A 


no DNA + GST tag 










X 


X 


X 


no DNA + His tag 








X 


X 


X 


X 



The criteria of the agarose gel electrophoresis are as 
follows . 

15 - : No PCR product N 

O: Only correct PCR products are confirmed. 
A: Both correct PCR products and byproducts are 
confirmed . 

x : Only byproducts are confirmed. 
20 From these results, it is clear that when the respective 

concentrations of the 5* DNA fragment and the 3' DNA fragment 
in the second PCR solution are 1 pmol/L, the DNA amplification 
does not occur, while when the concentrations are 5, 000 pmol/L 
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or more, the amounts of byproducts are increased to decrease 
the purity of the template DNA. Accordingly, it is found that 
when the respective concentrations of the 5' DNA fragment and 
the 3 1 DNA fragment are within the range of from 5 to 2,500 
5 pmol/L, the amplification of the template DNA occurs 
efficiently. 
[Comparative Example 2] 

Two-step PCR was performed in the same manner as in 
Examples 1 and 2 by varying concentrations of primers used in 
10 the first PCR as follows. T7P (GST) and T7P (His tag) were 
used as the 5 1 DNA fragment in the second PCR. All of the other 
conditions were the same as in Examples 1 and 2. The results 
of examining the second PCR products by 1% agarose gel 
electrophoresis were shown in Table 10. 



15 [Table 10] 



Sample 


1 2 


3 


4 


5 


6 


7 


Concentration of primer in first PCR solution (nmol/L) 












hRBS>B+6 primer 


5 15 


40 


80 


230 


480 


980 


pRas or pCAT 


5 10 


10 


20 


20 


20 


20 


p8. 2 primer 


10 25 


50 


100 


250 


500 


1000 


Concentration of primer in second PCR solution (nmol/L) 












p8. 2 primer 


0.5 1 .25 


2.5 


5 


12.5 


25 


50 


Results of analysis by agarose gel electrophoresis 












Ras + GST tag 




O 


O 


O 


X 


X 


Ras + His tag 


O 


O 


O 


0 


A 


A 


CAT + GST tag 


O 


O 


O 


0 


X 


X 


CAT + His tag 


O 


O 


0 


o 


O 


A 


no DNA + GST tag 








X 


X 


X 


no DNA + His tag 








X 


X 


X 



The criteria of the agarose gel electrophoresis are as 
follows . 

- : No PCR product 
20 O: Only correct PCR products are confirmed. 
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A: Both correct PCR products and byproducts are 
confirmed. 

X: Only byproducts are confirmed. 

From these results, it is clear that in the production 
5 of the template DNA for expression of Ras and CAT, the amounts 
of byproducts are increased when the concentration of 3 1 
primer: p8 . 2 used in the first PCR as an index is 25 nmol/L 
or more and the amplification of the desired DNA is therefore 
not conducted. Even when the template DNA was not used as a 

10 control, the amplification of only byproducts corresponding 
to primer dimers was confirmed. Meanwhile, when the 
concentration of the primer used in the first PCR was 10 nmol/L 
or less, no efficient DNA amplification was confirmed in the 
second PCR. 

15 Industrial Applicabili ty 

According to the method of the present invention, the 
template DNA for synthesis, purification and detection of the 
protein in the cell-free protein synthesis system or the like 
can be produced rapidly and efficiently on the basis of the 

2 0 cDNA or genomic DNA comprising the sequence coding for the 
protein. This enables construction of an automated system for 
the structure and functional analysis of the protein used for 
study of the Structural Genomics. 
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